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1^,,^ Abstract. Cylindrical algebraic decomposition (CAD) is an important 

j^ tool for the study of real algebraic geometry with many applications both 

^H within mathematics and elsewhere. It is known to have doubly exponen- 

^H tial complexity in the number of variables in the worst case, but the 

^+ actual computation time can vary greatly. It is possible to offer different 

^S| formulations for a given problem leading to great differences in tractabil- 

ity. In this paper we suggest a new measure for CAD complexity which 

I. ^ ' takes into account the real geometry of the problem. This leads to new 

>-' heuristics for choosing: the variable ordering for a CAD problem, a des- 

'^^ ignated equational constraint, and formulations for truth-table invariant 

C/3 CADs (TTICADs). We then consider the possibility of using Grobner 

O bases to precondition TTICAD and when such formulations constitute 

the creation of a new problem. 

> 

CN 1 Introduction 

(N 

p^ Cylindrical algebraic decomposition (CAD) is a key tool in real algebraic geom- 

• etry both for its original motivation, quantifier elimination (QE) problems [10 

']^ etc.], but also in other applications ranging from robot motion planning |25[ etc. 

^^ to programming with complex functions [13, etc.] and branch cut analysis [17 

,__! etc.]. Decision methods for real closed fields are used in theorem proving [Ts], so 

CAD has much potential here. In particular MetiTarski employs QEPCAD [4] to 
decide statements in special functions using polynomial bounds Ib^^S,- Work 
is ongoing to implement a verified CAD procedure in COQ [9|[22]. 

Since its inception there has been much research on CAD. New types of 
CAD and new algorithms have been developed, offering improved performance 
and functionality. The thesis of this paper is that more attention should now be 
given to how problems are presented to these algorithms. 

How a problem is formulated can be of fundamental importance to algo- 
rithms, rendering simple problems infeasible and vice versa. In this paper we 
take some steps towards better formulation by introducing a new measure of 
CAD complexity and new heuristics for many of the choices required by CAD 
algorithms. We also further explore preconditioning the input via Grobner bases. 



> 

X 



The final publication is available at http://link.springer.com. 



1.1 Background on CAD 

A CAD is a decomposition of M" into cells arranged cylindrically (meaning their 
projections are equal or disjoint) and described by semi-algebraic sets. Tradition- 
ally CADs are produced sign-invariant to a given set of polynomials in n variables 
X, meaning the sign of the polynomials does not vary on the cells. This definition 
was provided by Collins in |10| along with an algorithm which proceeded in two 
main phases. The first, projection, applies a projection operator repeatedly to a 
set of polynomials, each time producing another set of polynomials in one fewer 
variables. Together these sets provide the projection polynomials. The second 
phase, lifting, then builds the CAD incrementally from these polynomials. First 
M is decomposed into cells which are points and intervals corresponding to the 
real roots of the univariate polynomials. Then M^ is decomposed by repeating 
the process over each cell using the bivariate polynomials at a sample point of 
the cell. The output for each cell consists of sections of polynomials (where a 
polynomial vanishes) and sectors (the regions between these). Together these 
form the stack over the cell, and taking the union of these stacks gives the CAD 
of M^. This process is repeated until a CAD of E" is produced. This final CAD 
will have cells ranging in dimension from (single points) to n (full dimensional 
portions of space). The cells of dimension d are referred to as d-cells. 

It has often been noted that such decompositions actually do much more 
work than is required for most applications, motivating theory which consid- 
ers not just polynomials but their origin. For example, partial CAD fl^, etc.] 
avoids unnecessary lifting over a cell if the solution to the QE problem on a 
cell is already apparent. Another example is the use of CAD with equational 
constraints [21j etc.] where sign-invariance is only ensured over the sections of 
a designated equation, thus reducing the number of projection polynomials re- 
quired. It is worth noting that while the lifting stage takes far more resources 
that the projection, improvements of the projection operator have offered great 
benefits. 

Applications often analyse formulae (boolean combinations of polynomial 
equations, inequations and inequalities) by constructing a sign invariant CAD 
for the polynomials involved. However this analyses not only the given problem, 
but any formula built from these polynomials. In p] the authors note that it 
would be preferable to build CADs directly from the formulae and so define a 
Truth Table Invariant CAD (TTICAD) as one which is has invariant truth values 
of various quantifier-free formulae (QFFs) in each cell. In |3 an algorithm was 
produced which efficiently constructed such objects for a wide class of problems 
by utilising the theory of equational constraints. 

1.2 Formulating problems for CAD algorithms 

The TTICAD algorithm in \3, takes as input a sequence of QFFs, each of which is 
a formula with a designated equational constraint (an equation logically implied 
by the formula). It outputs a CAD such that on each cell of the decomposi- 
tion each QFF has constant truth value. The algorithm is more efficient than 



constructing a full sign-invariant CAD for the polynomials in the QFFs. since 
it uses the theory of equational constraints for each QFF to reduce the projec- 
tion polynomials used and hence the number of cells required. Its benefit over 
equational constraints alone is that it may be used for formulae which do not 
have an overall explicit equational constraint (and to greater advantage than the 
use of implicit equational constraints) . Many applications present problems in a 
suitable form for TTICAD, such as problems from branch cut analysis llTi. 

However, it is possible to envision problems where although separate QFFs 
are not imposed they could still lead to more economical CADs. (see Example 
^. Further, we may consider splitting up individual QFFs if more than one equa- 
tional constraint is present. This leads to the question of how best to formulate 
the input to TTICAD, a question which motivated this paper and is answered 
in Section |4j Some of this analysis could equally be applied to the theory of 
equational constraints alone and so this is considered in Section [3J 

In devising heuristics to guide this process we realised that the existing mea- 
sures for predicting CAD complexity could be misled. An important use for these 
is choosing a variable ordering for a CAD; a choice which can make a substantial 
difference to the tractability of problems. We use x ^ y to indicate x is less than 
y in an ordering. In |14 the authors presented measures for CAD complexity 
but none of these consider aspects of the problem sensitive to the domain we 
work in (namely real geometry rather than complex) . In Section p] we suggest a 
simple new measure (the number of zero cells in the induced CAD of M^) leading 
to a new heuristic for use in conjunction with |14| . We demonstrate in general 
it does well at discriminating between variable choices, and for certain problems 
is more accurate than existing heuristics. 

These three topics are all examples of choices for the formulation of problems 
for CAD algorithms. They are presented in the opposite order to which they 
were considered above, as it is more natural for presenting the theory. Problem 
formulation was considered in this conference series last year [27] where the 
idea of preconditioning CAD using Grobner bases was examined. This work is 
continued in Section [5] where we now consider preconditioning TTICAD. 

The tools developed for the formulation of input here lead to the question of 
whether their use is merely an addition to the algorithm or leads to the creation 
of a new problem. This question also arose in [26 where a project collecting 
together a repository of examples for CAD is described. In Section |6] we give our 
thoughts on this along with our conclusions and ideas for future work. 



2 Choosing a Variable Ordering for CAD 
2.1 Effects of variable ordering on CAD 



It is well documented 14 etc.] that the variable ordering used to construct 
a CAD can have a large impact on the number of cells and computation time. 
Example [T]gives a simple illustration. Note that the effect of the variable ordering 
can be far greater than the numbers presented here and can change the feasibility 



of a given problem. In |5 the authors prove there are problems where one variable 
ordering will lead to a CAD with a constant number of cells while another will 
give a number of cells doubly exponential in the number of variables. 

Example 1. Consider the polynomial / := (x— l)(y^ + 1) — 1 whose graph is the 
solid curve in Figure IT] We have two choices of variable ordering, which lead to 
the two different CADs visualised. Each cell is indicated by a sample point (the 
solid circles). Setting y ^ x we obtain a CAD with 3 cells; the curve itself and 
the portions of the plane either side. However, setting x < y leads to a CAD 
with 11 cells; five 2-cells, five 1-cells and one 0-cell. The dotted lines indicate the 
stacks over the 0-cells in the induced CAD of M^. With y ^ a; the CAD of M^ 
had just one cell (the entire real line) while with x < y there are five cells. 

We note that these numbers occur using various CAD algorithms. Indeed, 
for this simple example it is clear that these CADs are both minimal for their 
respective variable orderings, (i.e. there is no other decomposition which could 
have less cells whilst maintaining cylindricity.) 
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Fig. 1. Plots visualising the CADs described in Example [T] 



2.2 Heuristics for choosing variable ordering 



In [14] the authors considered the problem of choosing a variable ordering for 
CAD and QE via CAD. They identified a measure of CAD complexity that was 
correlated to the computation time, number of cells in the CAD and number of 
leaves in a partial CAD. They identified the sum of total degrees of all m,onom,ials 
of all projection polynomials, known as sotd and proposed the heuristic of picking 
the ordering with the lowest sotd. Although the best known heuristic, sotd does 
not always pick the ideal ordering as demonstrated by some experiments in |14] 
and sometimes cannot distinguish between orderings as shown in Example |2] 



Example 2. Consider again the problem from Example [T] Applying any known 
valid projection operator to / gives, with respect to y, the set of projection factors 
{a; — 1, a; — 2}, (arising from the coefficients and discriminant of /). Similarly, 
applying a projection operator with respect to x gives {y^ + 1}. Hence in this 
case both variable orderings have the same sotd. 

We consider why sotd cannot differentiate between the orderings in this case. 
Algebraically, the only visible difference is that one ordering offers two factors of 
degree one while the other offers a single factor of degree two. From Figure IT] we 
see that one noticeable difference between the variable orderings is the number of 
0-cells in the CAD of M^ (the dotted lines) . This is a feature of the real geometry 
of the problem as opposed to properties of the algebraic closure, measured by 
sotd. Investigating examples of this sort we devised a new measure ndrr defined 
to be the number of distinct real roots of the univariate projection polynomials 
and created the associated heuristic of picking the variable ordering with lowest 
ndrr. Considering again the projection factors from Example [2] we see that this 
new heuristic will correctly identify the ordering with the least cells. 

The number of real roots can be identified, for example, using the theory 
of Sturm chains. This extra calculation will likely take more computation time 
than the measuring of degrees required for sotd. However, both costs are usually 
negligible compared to the cost of lifting in the CAD algorithm. 



2.3 Relative merits of the heuristics 

We do not propose ndrr as a replacement for sotd but suggest they are used 
together since both have relative merits. We have already noted that the strength 
of ndrr is its ability to give information on the real geometry of the CAD. Its 
weakness is that it only gives information on the complexity of the univariate 
polynomials, compared to sotd which measures at all levels. If the key differences 
between orderings are not apparent in the univariate polynomials then ndrr is 
of little use, as in Example |3] 

Example 3. Consider the problem of finding necessary and sufficient conditions 
on the coefficients of a quartic polynomial so that it is positive semidefinite: 
eliminate the quantifier in, yx{px'^-\-qx+r+x'^ > 0). This classic QE problem was 
first proposed in 1 18| and was a test case in [l4]. There are six admissible variable 
orderings (since x must always be projected first). In all of these orderings the 
univariate projection factor set will consist of just the single variable of lowest 
order, (either p, q or r) and hence all orderings will have an ndrr of one. However, 
the sotd can distinguish between the orderings as reported in |!l4j. 

Despite the shortcoming of only considering the first level, ndrr should not 
be dismissed as effects at the bottom level can be magnified. We suggest using 
the heuristics in tandem, either using one to break ties between orderings which 
the other cannot discriminate or by taking a combination of the two measures. 



In 111] the authors suggested a second heuristic, a greedy algorithm based 
on sotd. This approach avoided the need to calculate the projection polynomi- 
als for all orderings, instead choosing one variable at a time using the sum of 
total degree of the monomials from those projection polynomials obtained so 
fax. Unfortunately there is not an obvious greedy approach to using ndrr. For 
problems involving many variables (so that calculating the full set of projection 
polynomials for each ordering is infeasible) we should revert to the sotd greedy 
algorithm, perhaps making use of ndrr to break ties. 



2.4 Coupled variables 

It has been noted in |,24] that a class of problems particularly unsuitable for 
sotd is choosing between coupled variables (two variables which are the real 
and imaginary parts of a complex variable). These are used, for example, when 
analysing complex functions by constructing a CAD to decompose the domain 
according to their branch cuts. The ordering of the coupled variables for the 
CAD can affect the efficiency of the algorithm, as in Example [4J 



Example 4- Consider / = \/z^+l where z G C The square root function has a 
branch cut along the negative real axis and so / has branch cuts when 



^{z^ + 1) = .T^ - j/2 + 1 < and 3(2^ + 1) = '2xy = 0, 

where x, y are coupled real variables such that z ~ x + iy. With variable ordering 
X < y we have sotd ~ 8, ndrr — 4 and a CAD with 21 cells while with variable 
ordering y < x we have sotd ~ 8, ndrr = 5 and a CAD with 29 cells. The CADs 
are visualised in Figure [2] using the same techniques as described for Figure [T] 
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Fig. 2. Plots visualising the CADs described in Example HI 



3 Designating Equational Constraints 

An equational constraint is an equation logically implied by a formula. The 
theory of equational constraints is based on the observation that the formula will 
be false for any cell in the CAD where the equation is not satisfied. Hence the 
polynomials forming any other constraints need only be sign invariant over the 
sections of the equational constraint. The observation was first made in 1 11 with 
McCallum providing the first detailed approach in [21_ . Given a problem with an 
equational constraint McCallum suggested a reduced projection operator, which 
will usually result in far fewer projection factors and a simpler CAD. 

This approach has been implemented in Qepcad, a command line interface 
for quantifier elimination through partial CAD ji] . It can also be induced in any 
implementation of TTICAD as discussed in Section [4] The use of equational con- 
straints can offer increased choice for problem formulation beyond that of picking 
a variable order. If a problem has more than one equational constraint then one 
must be designated for use in the algorithm. We propose simple heuristics for 
making this choice based on sotd and ndrr. 

Let P be the McCallum projection operator which, informally, is applied to a 
set of polynomials to produce the coefficients, discriminant and cross resultants. 
The full technical details are available in ll9l and a validated algorithm was given 
in [20] . Note that implementations usually make some trivial simplifications such 
as removal of constants, exclusion of polynomials that are identical to a previous 
entry (up to constant multiple), and only including those coefficients which are 
really necessary for the theory to hold. 

Next, for some equational constraint / let Pf be the reduced projection op- 
erator relative to / described in |2l] . Informally, this consists of the coefficients 
and discriminant of / together with the resultant of / taken with each of the 
other polynomials. This is used for the first projection, reverting to P for subse- 
quent projections. We can then apply the sotd and ndrr measures to the sets of 
projection polynomials as a measure of the complexity of the CADs that would 
be produced. We denote these values by S and N respectively and our heuristics 
are then to choose the equational constraint that minimises these values. 

We ran experiments to test the effectiven ess o f these heuristics using prob- 
lems from the CAD repository described in 26F We selected those problems 



with more than one equational constraint, for which at least one of the choices 
is tractable. The experiments were run in Maple using the ProjectionCAD 
package flG^ and the results are displayed in Table IT] with the cell count, compu- 
tation time and heuristic values given for each problem and choice of equational 
constraint. 

The full details on the problems can be found in the repository. The exam- 
ples each contain two or three equational constraints and the numbering of the 
choices in the table refers to the order the equational constraints are listed in the 
repository. The variable orderings used were those suggested in the repository. 



^ Freely available at http://opvis.bath.ac.uk/29503 



Problem 


EC Choice 1 


EC Choice 2 


EC Choice 3 


Cells Time S N 


Cells Time 


S N 


Cells Time S N 


Intersection A 


657 5.6 61 7 


463 5.1 


64 8 


269 1.3 42 4 


Intersection B 


711 6.3 66 6 


471 5.4 


71 6 


303 1.1 40 5 


Random A 
Random B 


375 2.7 81 9 
1295 21.4 140 13 


435 3.6 


73 8 


425 2.8 80 8 
1437 23.9 158 14 


477 3.8 


84 9 


Sphere-Catastrophe 


285 2.0 61 7 


169 1.0 


59 5 




Arnon84-2 
Hong-90 


39 0.1 54 5 
F - 14 


9 0.0 


47 1 


27 0.1 14 


F 


14 


Cyclic-3 


57 0.3 32 3 


117 0.7 


35 3 


119 0.6 36 4 



Table 1. Comparing the choice of equational constraint for a selection of problems. 
The lowest cell count for each problem is highlighted and the minimal values of the 
heuristics emboldened. 



The time taken to calculate S and N for each problem was always less than 0.05 
seconds and so insignificant to the overall timings. 

For each problem the equational constraint choice resulting in the lowest cell 
count and timing has been highlighted and the minimal values of the heuristics 
emboldened. We can see that for almost all cases both the heuristics point to 
the best choice. However, there is an example (Random A) where both point to 
an incorrect choice. The heuristic based on sotd is more sensitive (because it 
measures at all levels) and as a result is sometimes more effective. For example, 
it picks the appropriate choice for the Cyclic-3 example while the other does not. 

Although the sotd heuristic is superior for all these examples it can be misled 
by examples where the real geometry differs, as in Example [51 

Example 5. Consider the polynomials 

f:^y^~ 2y^x + yx^ + y^ y{y^ - (x + i))(y2 - (x - i)) 
g:=y^- 2y^x + yx^ - y = y{y^ - {x + l)){y^ - {x - 1)) 

along with the formula / = A g = and variable ordering x < y. We could use 
either / or g as an equational constraint when constructing a CAD. We have 



discrim(/) ~ 256(a:^ 



1) 



discrim(.g) = 256(x - lf{x + if 



and so both the projection sets have the same sotd. However, with / as an 
equational constraint the projection set has ndrr= while with g it is 2. The 
CADs of M^ have 3 and 31 cells respectively. 



4 Formulating Input for TTICAD 



Let <P represent a set of QFFs, {4>i\. In [s] the authors define a Truth- Table 
Invariant CAD (TTICAD) as a CAD such that the boolean value of each 



IS 



constant (either true or false) on each cell. Clearly such a CAD is sufficient for 
solving many problems involving the formulae. 

A sign-invariant CAD is also a TTICAD, however, in [s] the authors present 
an algorithm to construct TTICADs more efficiently for the case where each ipi 
has a designated equational constraint fi (an equation logically implied by (j)i). 
They adapt the theory of equational constraints to define a TTICAD projec- 
tion operator and prove a key theorem explaining when it is valid. Informally, 
the TTICAD projection operator produces the union of the application of the 
equational constraints projection operator to each (j)i along with the cross resul- 
tants of all the designated equational constraints, (see 13 for the full technical 
details). As noted in the introduction, TTICAD is more efficient than equational 
constraints alone. 

If there is more than one equational constraint present within a single ipi 
then a choice must be made as to which is designated for use in the algorithm, 
(the others would then be treated as any other constraint). As with choosing 
equational constraints in Section 3 the two different projection sets could be 
calculated and the measures sotd and ndrr taken and used as heuristics, picking 
the choice that leads to the lowest values. 

However, this situation actually offers further choice for problem formula- 
tion than the designation. If 0; had two equational constraints then it would be 
admissible to split this into two QFFs (l)i^i,4'i.2 with one equational constraint 
assigned to each and the other constraints partitioned between them in any man- 
ner. (Admissible because any TTICAD for (l)i^i,4>i.2 is also a TTICAD for 0^.) 
This is a generalisation of the following observation: given a formula cf) with two 
equational constraints a CAD could be constructed using either the traditional 
theory of equational constraints or the TTICAD algorithm applied to two QFFs. 
On the surface it is not clear why the latter option would ever be chosen since 
it would certainly lead to more projection polynomials after the first projec- 
tion. However, a specific equational constraint may have a comparatively large 
number of intersections with another constraint, in which case, while separating 
these into different QFFs would likely increase the number of projection polyno- 
mials it may still reduce the number of cells in the CAD, (since the resultants 
taken would be less complicated leading to fewer projection factors at subsequent 
steps) . Example p] describes a simple problem which could be tackled using the 
theory of equational constraints alone, but for which it is beneficial to split into 
two QFFs and tackle with TTICAD. 

Example 6. Let x ^ y and consider the polynomials 

f2-^{-V-l)-x^+x^ + x, 92-^-y-j + ^, 

and the formula := /^ = A 51 > A /2 = A §2 < 0. 

The polynomials are plotted in Figure [3] where the solid curve is /i , the solid 
line gi, the dashed curve /2 and the dashed line (72- The three figures also contain 
dotted lines indicating the stacks over the 0-cells of the CAD of M^ arising from 
the decomposition of the real line using various CAD algorithms. 



First, if we use the theory of equational constraints (with either /i or /2 as 
the designated equational constraint) then a CAD is constructed which identifies 
all the roots and intersection between the four polynomials except for the inter- 
section of gi and g2 ■ (Note that this would be identified by a full sign-invariant 
CAD). This is visualised by the plot on the left while the plot on the right relates 
to a TTICAD with two QFFs. In this case only three 0-cells are identified, with 
the intersections of 32 with /i and gi with /2 ignored. 

The TTICAD has 31 cells while the CADs produced using equational con- 
straints both have 39 cells. The TTICAD projection set has an sotd of 26 and 
an ndrr of 3 while each of the CADs produced using equational constraints have 
projection sets with values of 30 and 6 for sotd and ndrr. 
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Fig. 3. Plots visualising the induced CADs of R^ described in Example ml 



As suggested by Example [6] we propose using the measures sotd and ndrr 
applied to the set of projection polynomials as heuristics for picking an approach. 
We can apply these with the TTICAD projection operator for deciding if it would 
be beneficial to split QFFs. This can also be used for choosing whether to use 
TTICAD instead of equational constraints alone, since applying the TTICAD 
algorithm from [s] on a single QFF is equivalent to creating a CAD invariant 
with respect to an equational constraint. 

We may also consider whether it is possible to combine any QFFs. If the 
formulae were joined by conjunction then it would be permitted and probably 
beneficial but we would then need to choose which equational constraint to 
designate. Formulae joined by disjunction could also be combined if they share an 
equational constraint, (with that becoming the designated choice in the combined 
formula) . Such a situation is common for the application to branch cut analysis 
since many branch cuts come in pairs which lie on different portions of the same 
curve. However, upon inspection of the projection operators, we see that such 
a merger would not change the set of projection factors in the case where the 
shared equational constraint is the designated one for each formula. Note, if 
the shared equational constraint is not designated in both then the only way to 
merge would be by changing designation. 



When considering whether to spht and which equational constraint to desig- 
nate the number of possible formulations increases quickly. Hence we propose a 
method for TTICAD QFF formulation, making the choices one QFF at a time. 
Given a list ^ of QFFs (quantifier free formulae): 

(1) Take the disjunction of the QFFs and put that formula into disjunctive 
normal form, \/ (pi so that each 0; is a conjunction of atomic formulae. 

(2) Consider each (pi in turn and let rrii be the number of equational constraints. 

— If mi =0 then S is not suitable for the TTICAD algorithm of js], 
(although we anticipate that it could be adapted to include such cases). 

— If TOi = 1 then the sole equational constraint is designated trivially. 

— If mi > 1 then we consider all the possible partitions of the formula 
in (pi into sub QFFs with at least one equational constraint each, and 
all the different designations of equational constraint within those sub- 
QFFs with more than one. Choose a partition and designation for this 
clause according to the heuristics based on sotd and ndrr applied to the 
projections polynomials from the clause. 

(3) Let ^ be the hst of new QFFs, (pi, and the input to TTICAD. 

5 Using Grobner Bases to Precondition TTICAD QFFs 

Recall that for an ideal, / C K[x], a Grobner basis (for a given monomial or- 
dering) is a polynomial basis of / such that {\in{g) \ g e G} is also a basis 
for {lm(/) I / e /}. In [t] experiments were conducted to see if Grobner basis 
techniques could precondition problems effectively for CAD. Given a problem: 

a purely lexicographical Grobner basis {/i}*^i for the fi, (taken with respect 
to the same variable ordering as the CAD), could take their place to form an 
equivalent sentence: 

^:=ALi/.(x) = 0. 

Initial results suggested that this preconditioning can be hugely beneficial in 
certain cases, but may be disadvantageous in others. 

In 127 this idea was considered in greater depth. A larger base of problems 
was tested and the idea extended to include Grobner reduction. Given a problem: 

^ := (A:ii /. W = 0) A (A£i ft(x) *. 0) , *, e {=, ^, >, <}, 

you can first compute {fi}i^i followed by reducing the gi with respect to the fi 
to obtain {gi}l'^i. Then the following sentence will be equivalent to f/): 

'/':=(A*Li/»(x) = 0)A(A-Li5.(x)*.0). 

Experimentation showed that this Grobner preconditioning can be highly 
beneficial with respect to both computation time and cell count, however the 




Fig. 4. Plot of the functions described in Example W\ 



effect is not universal. To identify when preconditioning is beneficial a simple 
metric was posited and shown to be a good indicator. The quantity TNoI (total 
number of indeterminates) for a set of polynomials F is simply defined to be the 
sum of the number of variables present in each polynomial in F. In all testing 
carried out (both for [27] and henceforth) if the produced Grobner basis has a 
lower TNoI than the original set of polynomials then preconditioning is beneficial 
for sign-invariant CAD (the converse is not always true). 

A natural question is whether Grobner preconditioning can be adapted for 
TTICAD. This is possible by performing the Grobner preconditioning on the 
individual QFFs. There is a necessity, however, for a problem to be suitably 
complicated for this preconditioning to work: each QFF must have multiple 
equational constraints amenable to the creation of a Grobner Basis. This required 
complexity means there are few examples in the literature which are suitable and 
tractable for experimentation. We demonstrate the power of combining these two 
techniques through a worked example. 

Example 7. Consider the polynomials 



/l.l 


^ x^ + y'^ - 


-1, 


h,i 


= {x^Af + {y-lf- 


-1, 


/l,2 


— x^ + y^ - 


-1, 


/2,2 


^{x~Af + {y-lf- 


-1, 


91 


= xy-\, 




32 


= (x-4)(2/-l)-i 





and the formula [/i,i = A /i,2 = A 51 > 0] V [/2a = A /2,2 = A 52 > 0]. 

The polynomials are plotted in Figure |4] where the solid curves represent 
/i,i, fi,2,9i, and the dashed curves f2,i, 12.2,92- 

We will consider both variable orderings: y ^ x and x ^ y. We can compute 
full CADs for this problem, with 725 and 657 cells for the respective orderings. 
If we use TTICAD to tackle the problem then there are four possible two-QFF 
formulations, (splitting QFFs is not beneficial for this problem). The four for- 
mulations are described in the second column of Table [21 



Order 


Full CAD 


TTI CAD 






TTI+Cro CAD | 


Cells Time 


Eq Const 


Cells 


Time 


S 


N 


Eq Const 


Cells Time 


S N 


y -< x 


725 22.802 


/l,l./2,l 
./l,l>,f2,2 


153 

111 


0.818 
0.752 


62 

94 


12 
10 


/l,l,/2.1 


27 0.095 


37 3 


fi,i,h,2 


47 0.361 


50 5 






/l,2, /2,1 


121 


0.732 


85 


9 


fl,l,f2,3 
fl,2,f2,l 
fl,2,f2,2 


93 0.257 
47 0.151 
83 0.329 


50 9 
47 5 
63 7 


/l,2,/2,2 


75 


0.840 


99 


7 


























/l,2,/2,3 


145 0.768 


81 11 
















/l,3,/2,l 


95 0.263 


46 10 
















/l,3,/2,2 


151 0.712 


80 12 
















/l,3,/2,3 


209 0.980 


62 16 


X -<y 


657 22.029 


.fl,l,.f2,l 
f 1,1, f 2,2 


125 
117 


0.676 
0.792 


65 

96 


14 
11 


/l,l,/2,l 


29 0.085 


39 4 


/l,l,/2,2 


53 0.144 


52 6 






fl,2,f2,l 


117 


0.728 


88 


11 


/l,l,/2,3 
/l,2,/2,l 
h,2,h,2 


97 0.307 
53 0.146 
93 0.332 


53 97 
49 6 
65 8 


fl,2, f2,2 


85 


0.650 


101 


8 


























/l,2,/2,3 


149 0.782 


81 13 
















/l,3,/2,l 


97 0.248 


48 11 
















/l,3,/2,2 


149 0.798 


82 13 
















/l,3,,/"2,3 


165 1.061 


65 18 



Table 2. Experimental results relating to Example IT] The lowest cell counts are high- 
lighted and the minimal values of the heuristics emboldened. 



We can apply Grobner preconditioning to both QFFs separately, computing 
a Grobner basis, with respect to the compatible ordering, of {/i,i, fi,2}- For both 
QFFs and both variable orderings three polynomials are produced. We denote 
them by {/i,i, /i,2, /i.s} (note the polynomials differ depending on the variable 
ordering). The algorithm used to compute these bases gives the polynomials in 
decreasing order of leading monomials with respect to the order used to compute 
the basis (purely lexicographical). 

Table |2] shows that the addition of Grobner techniques to TTICAD can pro- 
duce significant reductions: a drop from 153 cells in 0.8s to 27 cells in under 0.1s 
(including the time required to compute the Grobner bases) . As discussed in [27] , 
preconditioning is not always beneficial, as evident from the handful of cases that 
produce more cells than TTICAD alone. As with Table[l]we have highlighted the 
examples with lowest cell count and emboldened the lowest heuristic. Looking 
at the values of S and N we see that for this example ndrr is the best measure 
to use. 

In |27) TNoI was used to predict whether preconditioning by Grobner Basis 
would be beneficial. In this example TNoI is increased in both orderings by 
taking a basis, which correctly predicts a bigger full CAD after preconditioning. 
However, TNoI does not take into account the added subtlety of TTICAD (as 
shown by the huge benefit above). 



6 Conclusions and Future Work 

In this paper we have considered various issues based around the formulation of 
input for CAD algorithms. We have revisited the classic question of choosing the 
variable ordering, proposing a new measure of CAD complexity ndrr to com- 
plement the existing sotd measure. We then used these measures as heuristics 
for the problem of designating equational constraints and QFF formulation for 
TTICAD. Finally we considered the effect of preconditioning by Grobner bases. 

It is important to note that these are just heuristics and, as such, can be 
misleading for certain examples. Although the experimental results in Section |3] 
suggest sotd is a finer heuristic than ndrr we have demonstrated that there are 
examples when ndrr performs better, not just Example Is] which was contrived 
for the purpose but also Example [7] introduced for the work on Grobner bases. 

These issues have been treated individually but of course they intersect. For 
example it is also necessary to pick a variable ordering for TTICAD. This choice 
will need to made before employing the method for choosing QFF formulation 
described in Section |4] However, the optimal choice of variable ordering for one 
QFF formulation may not be optimal for another! For example, the TTICAD 
formulation with two QFFs was the best choice in Example [6] where the variable 
ordering was stated as x ^ y but if we had y ^ x then a single QFF is superior. 

The idea of combining TTICAD with Grobner preconditioning (discussed 
in [t], [27]) is shown, by a worked example, to have the potential of being a very 
strong tool. However, this adds even more complication in choosing a formulation 
for the problem. Taken together, all these choices of formulation can become 
combinatorially overwhelming and so methods to reduce this, such as the greedy 
algorithm in 14 or the method at the end of Section B] are of importance. 



All these options for problem formulation motivate the somewhat philosoph- 
ical question of when a reformulation results in a new problem. When a variable 
ordering is imposed by an application (such as projecting quantified variables 
first when using CAD for quantifier elimination) then violating this would clearly 
lead to a new problem while changing the ordering within quantifier blocks could 
be seen to be a optimisation of the algorithm. Similar distinctions could be drawn 
for other issues of formulation. 

Given the significant gains available from problem reformulation it would 
seem that the existing technology could benefit from a redesign to maximise 
the possibility of its use. For example, CAD algorithms could allow the user to 
input the variables is quantifier blocks so that the technology can choose the 
most appropriate ordering that still solves the problem. 

We finish with some ideas for future work on these topics. 

— All the work in this paper has been stated with reference to CAD algorithms 
based on projection and lifting. A quite different approach, CAD via Trian- 
gular Decomposition, has been developed in J8] and implemented as part of 
the core Maple distribution. This constructs a (sometimes quite different) 
sign-invariant CAD by transferring the problem to complex space for solving. 
A key question is how much of the work here transfers to this approach? 



— Can the heuristics for choosing equational constraints also be used for choos- 
ing pivots when using the theory of bi-equational constraints in |6|? 

— Can the ndrr measure be adapted to consider also the real roots of those 
projection polynomials with more than one variable? 

We finish by discussing one of the initial motivations for engaging in work 
on problem formulation: a quantifier elimination problem proving a property of 
Joukowski's transformation. This is the transformation z i— ^ 2('^ "'" z) '^hich is 
used in aerodynamics to create an aerofoil from the unit circle. The fact it is 
bijective on the upper half plane is relatively simple to prove analytically but we 
found the state of the art CAD technology was incapable of producing an answer 
in reasonable time. Then, in a personal communication, Chris Brown described 
how reformulating the problem with a succession of simple logical steps makes 
it amenable to Qepcad, allowing for a solution in a matter of seconds. These 
steps included splitting a disjunction to form two separate problems and the 
(counter-intuitive) removal of quantifiers which block Qepcad's use of equa- 



tional constraints. Further details are given in 13, Sec. Ill] and in the future 
we aim to extend our work on problem formulation to develop techniques to 
automatically render this problem feasible. 
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